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ABSTRACT 

Context. One of the biggest challenges facing large transit surveys is the elimination of false-positives from the vast number of transit 
candidates. A large amount of expensive follow-up time is spent on verifying the nature of these systems. 

Aims. We investigate to what extent information from the lightcurves can identify blend scenarios and eliminate them as planet 
candidates, to significantly decrease the amount of follow-up observing time required to identify the true exoplanet systems. 
Methods. If a lightcurve has a sufficiently high signal-to-noise ratio, a distinction can be made between the lightcurve of a stellar 
binary blended with a third star and the lightcurve of a transiting exoplanet system. We first simulate lightcurves of stellar blends and 
transiting planet systems to determine what signal-to-noise level is required to make the distinction between blended and non-blended 
systems as function of transit depth and impact parameter. Subsequently we test our method on real data from the first IRaOl field 
observed by the CoRoT satellite, concentrating on the 5 1 candidates already identified by the CoRoT team. 

Results. Our simulations show that blend scenarios can be constrained for transiting systems at low impact parameters. At high 
impact parameter, blended and non-blended systems are indistinguishable from each other because they both produce V-shaped 
transits. About 70% of the planet candidates in the CoRoT IRaOl field are best fit with an impact parameter of b >0.85, while less 
than 15% are expected in this range considering random orbital inclinations. By applying a cut at b < 0.85, meaning that ~15% of 
the potential planet population would be missed, the candidate sample decreases from 41 to 11. The lightcurves of 6 of those are 
best fit with such low host star densities that the planet-to-star size ratii imply unrealistic planet radii of R > 2R Jup . Two of the five 
remaining systems, CoRoTlb and CoRoT4b, have been identified as planets by the CoRoT team, for which the lightcurves alone rule 
out blended light at 14% (2cr) and 31% {2a). One system possesses a M-dwarf secondary, one a candidate Neptune. 
Conclusions. We show that in the first CoRoT field, IRaOl, 85% of the planet candidates can be rejected from the lightcurves alone, 
if a cut in impact parameter of b < 0.85 is applied, at the cost of a < 15% loss in planet yield. We propose to use this method on the 
Kepler database to study the fraction of real planets and to potentially increase the efficiency of follow-up. 

Key words. 



1. Introduction 

With the CoRoT and Kepler space observatories in full swing 
(Baglin et al. 2006, Borucki et al. 2003), which both deliver 
thousands of lightcurves with unprecedented photometric pre- 
cision and cadence, we have moved into an exciting new era 
of exoplanet research. Now, the characterisation of small, possi- 
bly rocky planets has finally become a realistic prospective (e.g. 
Corot-7b, Leger et al. 2009; Kepler-lOb, Batalha et al. 2011). 
One of the biggest challenges is to seperate real planets from 
the significant fraction of (astrophysical) false-positives that can 
mimic a genuine transit signal (e.g. Batalha et al. 2010). Ground- 
based transit surveys have revealed that stellar eclipsing bina- 
ries (EBs) blended with light from a third star are the main 
source of contamination (e.g. Udalski et al. 2002). Also, for 
Super-Earth planet candidates blends with a background tran- 
siting Jupiter-sized planet system can be important. In these sys- 
tems the eclipse depth, shape and ellipsoidal light variations of 
an EB are diluted by the effects of chance alignment of a fore- 
ground or background star or associated companion inside a 
photometric aperture set by either the pixel scale or the point 
spread function. In addition, light from a third star in the pho- 
tometric aperture can bias the fitted parameters of a planet tran- 
sit system. High resolution, high signal-to-noise spectra are nor- 
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mally required to exclude binary scenarios by excluding their 
large radial velocity orbi-sector variations, a process that can be 
very time-consuming. 

Stellar blends are common in space-based transit surveys as 
apertures are relatively large (e.g. 19"x21" for CoRoT), and tar- 
get fields are crowded since the number of target stars is maxi- 
mized in this way. To weed out false-positives, the CoRoT team 
relies on an extensive ground-based follow-up campaign for on- 
off photometry to identify the transited star in the CoRoT aper- 
ture (Deeg et al. 2009) and high resolution imaging observa- 
tions to identify possible stars that dilute the lightcurve of a 
planet candidate. Even so, many candidates remain unresolved 
and defy easy characterisation after such a campaign. Kepler 
uses its unique astrometric precision to minimise the number of 
blends, which can be identified by a position shift of the flux cen- 
troid during transit, but will still require enormous ground-based 
efforts on the remaining ~1200 candidates (e.g. Borucki et al. 
2011). Together with the new influx of planet candidates from 
current surveys, possible future missions (such as PLATO; e.g. 
Catala et al. 201 1) and ground-based efforts to hunt for planets 
around low-mass stars, the telescope demand for full follow-up 
may grow enormously. Therefore, any new technique or strat- 
egy that can eliminate even a moderate fraction of all candidates 
from the discovery lightcurves, prior to follow-up, is extremely 
valuable. 
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In this paper we investigate to what extent information from 
the lightcurves themselves can identify blend scenarios and 
eliminate them as planet candidates and on the other hand rule 
out blend scenarios in the case of true planet systems. Our key 
motivation is that the lightcurves of blended systems can not be 
perfectly fit by pure transit models and neither can genuine tran- 
sits befit by blended light models. In section 2 we introduce our 
lightcurve fitting procedure and in section 3 we apply it to simu- 
lated data of a transiting hot Jupiter and Super-Earth. While such 
a procedure provides a natural tool to distinguish blends from 
genuine planetary systems by lightcurve fitting, it breaks down 
for transits with high impact parameters. We therefore only con- 
sider transiting systems with impact parameter b < 0.85, loosing 
potentially ~15% of the planet catch, but significantly decreas- 
ing (by an order of magnitude) the required amount of follow-up 
observations. In section 4 we apply our method to the candidates 
of the CoRoT IRaOl field, whose candidates are almost com- 
pletely characterised through an extensive follow-up campaign, 
and discuss the results in section 5. 



2. Method 

2.1. Transit fitting 

Several methods have been presented in the literature to identify 
blended systems and to select the best planet candidates. Seager 
& Mallen-Ornellas (2003) proposed a diagnostic that involves 
fitting a trapezoid to the transit lightcurve to obtain estimates for 
the transit parameters and subsequently identify the best candi- 
dates. In this paper we use a method very similar to that used 
by Snellen et al. (2009) to reject blend scenarios for the tran- 
siting hot Jupiter OGLE2-TR-L9. It involves least-square fitting 
of a lightcurve using the standard transit parameters (see below) 
plus an additional parameter representing the extra light from 
a third light source. If the fit is significantly better with extra 
light, the lightcurve is from a blended system. If this is not the 
case, an upper limit to the third light fraction can be set to a 
degree depending on the signal-to-noise of the data. This pro- 
cedure is in essence similar to Blender, which is used by the 
Kepler team (e.g. Torres et al. 2011). However, Blender simu- 
lates physical systems involving so many parameters that it is 
impractical to run on a large number of candidates. Here we are 
not interested in the true nature of the second object (whether it 
is a background, foreground or physically related star), just in its 
possible influence on the transit lightcurve. 

We assume at this point that lightcurves with obvious signs 
of the presence of a stellar binary, such as ellipsoidal light vari- 
ations and/or secondary eclipses, have been excluded from the 
candidate list. Note that a useful upper limit to the amount of 
ellipsoidal light variation, and the likelihood of a genuine plan- 
etary secondary, can be obtained by taking a Fourier transform 
of the data with the transit signal removed. We therefore do not 
require EBOP (Popper and Etzel 1981) to model the complex 
binary effects in the lightcurve, but rather utilize an IDL rou- 
tine that incorporates the analytical transit model of Mandel and 
Agol (2002;M&A). Our system simply consists of a secondary 
transiting a primary with possible additional light from a tertiary. 



2.2. Transit parameters 

We treat the transit mid-time To and the orbital period P as fixed 
parameters, resulting from the candidate selection process. For 
extra simplicity we keep the limb darkening parameters fixed 
at the tabulated solar values for CoRoT white light, assuming 



quadratic parameters (a,b)=(0.44,0.23) from Sing et al. (2010). 
Although this gives a small bias (<0.06 in impact parameter) for 
primary stars of different stellar type, the method is not meant for 
precise planet characterization and does not influence the char- 
acterization of potential blended and non-blended systems. Our 
transit model has three free parameters; the ratio of secondary 
over primary radii (R2/R\), the impact parameter of the transit b, 
which is the smallest projected distance of the centre of the sec- 
ondary to that of the primary in units of R\, and the density of 
the primary star p\. This density can be converted to the scaled 
orbital radius (a//?i), assuming that Mi»M2, through 
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The relative projected distances z between secondary and pri- 
mary are computed from the input orbital phases (/>, 



z(<p)= A/l^-l sin(0 + b 2 cos((f>) 2 , 



(2) 



Together with (R2/R\), these are used as input to a custom-made 
IDL program, incorporating the routine from M&A, that com- 
putes the theoretical models. We introduce light to this transit 
system by adding the blended light fraction k, 



F totalis, b,R 1 /R 2 ,p»,k) = F edipse -(l-k)+k, 



(3) 



where F ec u pse is the original transit lightcurve. We then devise 
the following chi-square statistic to compare the lightcurve to 
the data F i, s j with uncertainty cr ois ,-, 



X 



(4) 



obs,i 



Note that we assume circular orbits. This has no influence on the 
characterization of blended and non-blended systems, but it does 
affect the derived host star density, and is therefore important for 
the estimate of the radius of the secondary object. This is further 
discussed in section 5. 



2.3. MCMC 

To obtain the best-matching system parameters, we use a Monte 
Carlo Markov Chain x 2 optimisation technique (MCMC, e.g. 
Tegmark et al. 1998) to map out the probability distribution for 
each lightcurve parameter. MCMC is found to be a more ro- 
bust technique to obtain a global parameter solution in multi- 
parameter space than (downhill) grid-based methods, due to the 
resolution inefficiency of the latter (e.g. Serra et al. 2011). In 
the MCMC algorithm, the parameters p t are perturbed by an 
amount drawn from a normal distribution N according to: p i+ \ = 
Pi+ f ■ N ■ o-p, where / is the jump function and cr p the standard 
deviation of the sampling distribution for each p. Subsequently 
X 1 is recalculated for these perturbed parameters and a Gaussian 
likelyhood X a exp(~x 2 /2) is determined. These random jumps 
in parameter space are accepted or rejected according to the 
Metropolis-Hastings rule (Metropolis et al. 1953;Hastings 1970) 
. If the perturbed parameter set has a higher likelyhood X' than 
its progenitor, it will be accepted as a new chain point, otherwise 
it will be accepted with a probability of £,'/£,. We run the algo- 
rithm many times to build up a 'chain' of parameter values and 
tweak cr p and / such that ~40% of the jumps are accepted. After 
creating multiple chains from different starting conditions, we 
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check proper model convergence and mixing of the individual 
chains using the Gelman & Rubin R statistic (Gelman & Rubin 
1992). To save time, first k is set to zero at the minimum^ 2 de- 
termined with MCMC analysis. Subsequently k is increased in 
small steps (but always kept fixed during the MCMC) with the 
previously found parameters as starting values. In this way the 
parameter values (adopting the median of the distribution) and 
the uncertainties in the parameters are determined as function of 
k in an efficient way. 



3. Tests on synthetic lightcurves 

In this section, we test our method on synthetic lightcurves to 
determine the required precision to detect or exclude third light 
in a particular transit system. We perform these simulations for 
two candidate systems: (i) a hot Jupiter orbiting a solar type star 
and (ii) a Super-Earth around a similar host. 

3.1. Transiting hot Jupiter 

We simulated a set of transit lightcurves for a hot Jupiter with 
R 2 = lRjup and P=2.5 days, orbiting a star with a solar den- 
sity, for a range of impact parameters. The lightcurve for an im- 
pact parameter of b — 0.2 is shown in FigureQ] As explained in 
the previous section, our method finds the best fit for a range in 
blended light fraction k. Of course, in this simulation a perfect 
fit is obtained for k-Q. As can be seen in Figure [1] an increas- 
ingly worse fit is obtained for increasing k, most obviously seen 
by comparing the £=0.95 model to the synthetic data. This latter 
model fit assumes that 95% of the light is from a third object, 
meaning that the unblended transit is actually a factor 20 deeper, 
hence 20% deep instead of 1%. It implies that R 2 /Ri ~ 0.45, 
resulting in a much longer duration transit unless it is grazing. 
This results in the best-fitting £=0.95 model being much more 
V-shaped than the synthetic lightcurve of the planet. We can 
now convert the differences between the synthetic lightcurves 
and model fits to x 1 values for each combination of b and k by 
assigning uncertainties to the synthetic data. In this way we can 
determine what photometric precision is required to exclude a 
certain blended light fraction in the lightcurves as a function of 
b. Figure [2] shows the precision per 5 minutes of data required 
to exclude a blended light fraction k at a 3cr level in a system 
with an impact parameter of b=0.2, 0.5, 0.8, and 0.95. The re- 
quired precision becomes more stringent for lower values of k 
and higher values of b. For b=0.2, 80% blended light (£=0.8) 
can be excluded in a lightcurve with a precision of only 2 x 10 -3 
per 5 minutes, while for b=0.8, 20% of blended light can only 
be rejected if the lightcurve has a precision of 4 x 10~ 5 per 5 
minutes. 

3.2. Transiting super-Earth 

We performed also tests on a Super-Earth with R 2 = 2.5R® or- 
biting a sun-like star, following the same procedure as described 
above. Since the transit itself is a factor ~ 16 more shallow than 
for a Jupiter-size planet, the level of precision required to re- 
ject blend scenarios is also significantly higher, as can be seen in 
Figure [3] Note however that even for a blended light fraction of 
£=0.95, the radius of transiting object R 2 is still in the Jupiter- 
size regime. Hence only if the blended light fraction is very high, 
k > 0.95, can an eclipsing binary mimic a Super-Earth transit. 
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Fig. 1: Simulated lightcurve for a transiting exoplanet system consist- 
ing of a hot Jupiter in a 2.5 day orbit around a solar type star with 
impact parameter b=0.2 (black dots). The solid curves show diluted bi- 
nary models with best-fit parameters determined by MCMC, for blended 
light fraction k=[0.2, 0.5, 0.8, 0.95]. 
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Fig. 2: The photometric precision per 5 minutes required to exclude a 
blended light fraction k at 3o~ for a hot Jupiter around a solar type host 
star (R2/R1 = 0. 1 and p, = p Q ), as a function of the system parameters 
b and k. The four solid curves are for impact parameters b=0.2, 0.5, 
0.8, and 0.95. The upper and lower horizontal dotted lines indicate the 
range in precision for objects in the IRaOl field, determined by Ai grain 
et al. (2009). 

4. Tests on candidates in the CoRoT IRa01 field 

4.1. The data set 

In this section we test our method on real data, using the 
lightcurves of the candidates selected by the CoRoT team from 
C0R0T field IRaOl (Caipano et al. 2009). In this first field tar- 
geted by CoRoT, 3898 bright stars were observed in chromatic 
mode (with a blue, green and red channel) and another 5974 in 
a single monochromatic "white" band in a 66 day staring run 
towards the Galactic anti-center. From the 50 initial candidates, 
a subsample of 29 promising targets received extensive follow- 
up as discussed in Moutou et al. (2009). Two of these have so 
far been identified as genuine planets: CoRoT-lb, a low density 
R p = 1 A9R j up transiting hot Jupiter around a GOV host (Barge 
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Fig. 3: As for Fig. [2] but for a 2.5R Eart i, SuperEarth planet around a 
solar type host in a 2.5 day orbital. We can exclude 80% blended light 
at the 3o~ level at a moderate impact parameter ofb=0.5. The horizontal 
dashed line refers to the precision reached in the discovery lightcurve 
ofCoRoT 7b, the first rocky SuperEarth planet (Leger et al. 2009). 



et al. 2008) and CoRoT-4b, a R p = 1.19/? Jup hot Jupiter around 
a F8V host (Aigrain et al. 2008). Seventeen additional systems 
were solved using the photometric and spectroscopic follow-up 
observations (Moutou et al. 2009). We choose to test our method 
on the 45 bright candidates with more than one transit observed, 
using the publicly available N2-level data. 

4.2. Pre-cleaning of the lightcurves 

We first combine the multicolor lightcurves into one single 
'white lightcurve' for each candidate under the assumption that 
the CoRoT analysis teams did not detect any significant vari- 
ation of eclipse depth with wavelength, which would already 
have been a clear sign of blending effects. We first clip each 
lightcurve by removing outliers at the 5<x level. These outliers 
are mostly associated with the epochs at which the satellite 
passes the South Atlantic Anommaly (SAA) or moves in/out of 
the Earth's shadow. We then iteratively refine the mid-times To 
and the orbital period P using the Kwee-van Woerden method 
(Kwee & van Woerden 1956) and cross-correlation with a the- 
oretical transit model (e.g. Rauer et al. 2009). Individual transit 
events that show temporary jumps in flux, caused by the impact 
of energetic particles (mainly protons) onto the CCD ("hot pix- 
els"), are excluded from our analysis. For 16 out of the initial 50 
CoRoT IRaOl candidates (32%) we had to remove one or more 
transits from the lightcurve that were affected by such particle 
hits. Each individual lightcurve was then phasefolded around 
every transit. To normalise the data, we fit either a first order 
polynomial in a small range in phase (+0.1 from mid-transit) 
around each transit or a higher order polynomial (order «=13) 
in a larger phase range (typically +0.4 in phase), depending on 
which approach delivers the lowest rms in and out of eclipse and 
the least red noise (Pont et al. 2006). Figure|4]shows a typical ex- 
ample of the dominant frequencies still remaining after the poly- 
nomial fit. For most objects, distinct peaks exist around periods 
of 103 minutes and at 24 hours. We identify these peaks with 
remaining systematics, related to the satellite's orbit and Earth's 
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Fig. 4: Fourier diagram of an example noise spectrum prior to 
lightcurve cleaning. Amplitude of the best-fitting sine curve on the ver- 
tical axis is plotted against frequency. Peaks around frequencies of 1.0 
and ~14 are due to remaining systematics related to the satellite orbit 
and Earth 's rotational period. 
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Fig. 5: The MCMC solution for all IRaOl candidates in (b,(R p /R,)) 
space. Note the strong parameter degeneracy at high b. Yellow CoRoT 
WinlDs are the two confirmed planets CoRoT-lb and CoRoT-4b, blue 
objects are confirmed blends from the follow-up work presented in 
Moutou et al. (2009), and red sources are either unsettled cases or con- 
firmed genuine binaries with non-planetary secondary masses from the 
radial velocity variations. 



rotational period, caused by ingress and egress of the spacecraft 
from Earth's shadow, variations in gravity and magnetic field 
and changes in the levels of thermal and reflected light from the 
Earth (e.g. Aigrain et al. 2009). By folding the out-of-eclipse 
data onto the dominant frequencies of the Fourier diagram, we 
then fit a sinusoidal function to the remaining systematics, fol- 
lowed by median averaging over all transits. We subsequently 
binned the lightcurves and assign errors, according to the stan- 
dard deviation divided by the square root of the number of points 
in each bin. 
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4.3. Fitting the lightcurves 

Each lightcurve is first fitted with the method explained in 
section 2, assuming k=0, yielding the starting parameter sets 
(R2/R\,b,p*) for our blend analysis. In Figure |5]we show the re- 
sulting MCMC distribution of impact parameter b versus R2/R1 
for all the 45 candidates. C0R0T WinlDs (a shortcut of the 
C0R0T run identification number, e.g. IRaOl-E 1-2046) for each 
candidate are indicated, with yellow for the two confirmed plan- 
ets CoRoTlb and CoRoT4b, in blue those candidates that have 
been confirmed to be blended systems by Moutou et al. (2009), 
and in red unsettled cases (either suspected early type stars with 
only few or very broad spectral lines for further radial velocity 
follow-up observations with HARPS or confirmed genuine EBs 
with non-planetary secondary masses). As can be seen, a large 
fraction of the candidates are, assuming no blended light, best fit- 
ted with a very high (often larger than unity) impact parameter. 
This is even more clear in the distribution of fitted impact pa- 
rameters as shown in figure[6] For 32 out of the 45 (~ 70%) can- 
didates b > 0.85, while from geometric arguments it is expected 
that ~ 15% of planets would be found at such a high impact 
parameter. Assuming that all eleven candidates at b < 0.85 are 
non-blended systems only ~1.6 objects are expected at b > 0.85. 
Since our tests in section 3 have shown that it is very difficult to 
distinguish blends from non-blended systems at high impact pa- 
rameters due to their V-shaped lightcurves, we apply a cut in the 
candidate list at b < 0.85, knowing that we will potentially re- 
move only a small fraction of the planet yield, in the case of the 
CoRoTaOl field < 0.3 planets. From this it can be seen that it is 
highly likely that all candidates with b > 0.85 are blended and/or 
grazing eclipsing binaries. For the eleven remaining candidates 
we used the transit parameters from the k=0 model to refit the 
lightcurve with an increasing value of k, as outlined in section 3. 
In this way we redetermine the best fit solution and^ 2 as a func- 
tion of k. As an example we show the best fit transit models for 
a range of k and the x 1 as function of k for candidate El -4617 
in Figure Q As can be seen, the lightcurve can only be well fit- 
ted by models with a low k. E.g. the^- 2 of the best fitting A:=0.5 
model is ~ 40% higher than that for k-Q. The 2 sigma upper 
limit for the fraction of blended light (A^ 2 ) is £=0.20. We per- 
formed this same analysis for all eleven remaining candidates for 
which the x 2 versus k plots are shown in the Appendix, together 
with their best fit lightcurves. None of these candidates are better 
fitted by a high k model than a low k model, indicating that all 
blended systems have moved out of the remaining sample since 
they are all fitted with a high impact parameter. For six objects 
a significant fraction of blended light can be excluded from the 
lightcurve alone, including CoRoT-lb and CoRoT-4b. It would 
therefore not have been necessary to check whether the variabil- 
ity in these candidates came from the target star or not and the 
follow-up could have immediately concentrated on radial veloc- 
ity measurements. 

All parameters of the remaining candidates are shown in 
Table 1. An additional cut in the candidate list is made us- 
ing a combination of the best fit mean stellar densities p, and 
R2/R1, as shown in Figure [8] Six of the candidates have host 
stars with densities corresponding to A-stars, resulting in unre- 
alisticly large secondary radii of > 2.QRj up . Note that there is 
currently no consensus on the upper limit of planet size, mean- 
ing that by setting a hard limit on planet radius we may exclude 
very large or bloated (hot) Jupiters. However, there are currently 
only 4 out of 219 transiting exoplanets reported with radii larger 
than l.ERj U p (www.exoplanet.eu). Also, the probability that the 
secondary is a mid-type M-dwarf rather than a genuine planet 



increases when considering larger radii. This results in a remain- 
ing planet candidate sample of 5 objects instead of the original 
45 using arguments based on the lightcurve alone. These five ob- 
jects have been marked with filled symbols in Figure [8] Details 
on each system are discussed in Appendix A. 
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Fig. 6: Distribution of fitted impact parameter of the C0R0T IRaOl can- 
didates. The distribution is strongly peaked around b=1.0, indicating 
a significant population of (blended) EB contaminants. For a genuine 
planet distribution we would expect aflat histogram that falls off at high 
impact parameter. The dotted vertical line indicates the b=0.85 cutoff 
we have proposed in this paper. 
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(R2/R1) versus p„ 
4617 0102753331 
2430 0102815260 
4073 0102863810 
1736 0102855534 
3724 0102759638 



19.76 
3.59 
15.00 
21.72 
12.33 



0.19 
0.10 
0.18 
0.11 
0.10 



0.10 
0.24 
0.36 
0.43 
0.50 



-1.42 
-0.81 
-0.08 
-1.24 
-1.33 



10.47 
5.36 
24.40 
12.77 
8.17 



20% 
44% 
67% 
62% 
78% 



Table 1 : The candidate sample that survives the impact param- 
eter cut. The last six sources are excluded using a second cut 
because the fitted host star density indicates a secondary radius 
R2 > 2R j up . The last column indicates the 2o~ upper limit to the 
blended light fraction k. 



5. Discussion 

In this paper we investigated to what extent we can use the high 
signal-to-noise lightcurves of space-based exoplanet transit sur- 
veys to identify blended light scenarios, and eliminate them as 
planet candidates. We concentrated on the 51 exoplanet candi- 
dates from the first C0R0T IRaOl field (Carpano et al. 2009). 
About 70% of the 51 planet candidates in the C0R0T IRaOl 
field are best fit with an impact parameter of b > 0.85, which 
at face value already indicates that the candidate distribution is 
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R 2 /Ri vs Host star density 
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Fig. 7: Panel a): The reduced x 1 as function of blended light fraction k. 
The horizontal dashed line shows the 2,3 and 5cr rejection criteria. Our 
lightcurve model directly indicates an early type main sequence stel- 
lar host, with a 2cr upper limit for blended light of k ~ 20%. The low 
stellar density implies a large secondary radius, rejecting the planet 
hypothesis. Panel b): best fitting EB models with blended light frac- 
tion k=[0.2, 0.5, 0.90,0.95], clearly showing that solutions with low k 
are favoured. Note that an orbit with an eccentricity of e-0.5, orien- 
tated in the right way, could increase the estimated stellar density to 
that of the Sun, and decrease R2 to 2 R Jup . This ambiguity can be easily 
removed by taking a single spectrum of the star, resolving its spectral 
type. 



strongly contaminated by blended and/or grazing systems. We 
find that by cutting a candidate sample such that those objects 
with high impact parameter are removed, at the cost of losing 
a small fraction of potential planets, a significant reduction in 
required follow-up observations can be achieved. Of all candi- 
dates, only 5 remain in the final sample of which two are genuine 
planet systems, one is a low mass transiting M dwarf and one is 
a candidate Neptune. 

The V-shaped lightcurves of near-grazing planet systems are 
strongly degenerate with blended eclipsing binary systems and 
can therefore not be distinguished from each other. How many 
planets are potentially missed by invoking the cut in impact pa- 
rameter? Of the known transiting exoplanets, ~6% has an im- 
pact parameter larger than 0.85 and ~16% an impact parameter 




Fig. 8: The R2/R1 size ratio versus the log of the stellar density for 
the C0R0T candidates in the IRaOl field, assuming k=0. The dotted 
lines mark the densities of A to M type main-sequence hosts. The five 
filled dots are the candidates that survive both our cuts in impact pa- 
rameter and secondary size. The two confirmed transiting hot Jupiters 
CoRoTlb and CoRoT4b are shown as yellow filled dots. Open circles 
are the candidates we have excluded using our cuts. Blue circles indi- 
cate sources which have been identified as blended EBs by the C0R0T 
team follow-up, and red circles are either unsettled cases in the C0R0T 
follow-up or systems identified as genuine EBs through their radial ve- 
locities. The four solid curves indicate R 2 =[0.10,0.50,l. 0,2. 0]R Jup , as- 
suming the main sequence mass-radius relation of Cox (2000) for the 
primary. 



of more than 0.75Q. The cumulative probability of a particular 
transit at a given impact parameter greater or equal to a cutoff 
value bx and transit depth AF is given by: 



Pc(b > b x ) 



1 + VaF - b x 1 + R2/R1 



1 + yAF 



1 +R2/R1 



(5) 



Note that this expression is different from the equation presented 
in Seager and Mallen-Ornellas (2003), because the maximum 
impact parameter in their formula is determined by the grazing 
condition b max — 1 - R2/R1, yielding a minus sign in equation 
5. For a 1Rj„ p planet around a solar type star ~22% would po- 
tentially be missed by setting the cut in impact parameter (~6% 
according to Seager and Mallen-Ornellas). However, extremely 
grazing systems will be very shallow and of short duration and 
will therefore provide very limited physical information. For ex- 
ample, a grazing lRj up with impact parameter b =1.05, will 
show a transit with a duration of 30% and only 20% of the depth 
of a transit with b = 0. Therefore, the actual planet loss fraction 
will be closer to the predictions of Seager and Mallen-Ornellas 
(2003), i.e. < 15%. 

In this paper we have made the assumption of circular or- 
bits, but radial velocity surveys teach us that such an assumption 
is not valid for longer periods (e.g. Butler et al. 2006). In addi- 
tion, Barnes (2007) shows that a planet with an eccentric orbit is 
more likely to transit by a factor of (1 - e 2 )~ l than a planet in a 
circular orbit with the same semi-major axis. A significant pop- 
ulation of transiting exoplanets with an eccentric orbit is there- 
fore expected for long duration space-based surveys. Because 

the planet orbital velocity varies from -Jj^Vdrc to ^j-^Vcirc 
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between periastron and apastron in an eccentric orbit, transit du- 
ration can vary as function of e and to (the angle of pericenter). 
This leads to a wrong fit of the host star density (e.g. Kipping 
2010a, Tingley et al. 2011), therefore directly affecting our es- 
timate of the secondary radius R2- We therefore can not reliable 
make the planet-to-star ratio versus host star density cut in the 
eccentric orbit case for longer period planets (P > 3.0days). 
Fortunately, the fitted impact parameter, R2/R1 and blended light 
fraction k are not affected by an eccentric orbit. This means that 
we can still first apply a cut in impact parameter b < 0.85 and 
remove likely blends. To subsequently determine the real host 
star density it is sufficient to take a single high-resolution spec- 
trum to determine p\ and estimate R2- Using this spectroscop- 
ically determined density an upper limit to ecos(u>) can be set. 
One particular case in our sample is CoRoT-4617 with an orbital 
period of P=19.76 days. Assuming a circular orbit, the host star 
is estimated to have a density only ~4% of that of the Sun, in 
accordance with an early B-star. This would imply that the ra- 
dius of the secondary object has R2~8R./„ ; ,. However, an orbit 
with an eccentricity of e=0.5, orientated in the right way, could 
increase the estimated stellar density to that of the Sun, and de- 
crease R2 to 2 Rj up . This ambiguity can be easily removed by 
taking a single spectrum of the star, resolving its spectral type. 

The method presented here is designed to remove false- 
positives in candidate lists through the identification of blended 
light. We do not assign a likelihood of planetary nature to the 
remaining candidates, meaning that we do not assess whether 
these are genuine planet systems, we just removed those sys- 
tems which are not (except for a small fraction of collateral dam- 
age). However, it is anyway interesting to link blended light frac- 
tions to the population of random background eclipsing binaries. 
Assuming that 1:300 of field stars are eclipsing binaries (Devor 
et al. 2008), and 1:1000 stars have a transiting hot Jupiter, we 
require an average of 0.3 background stars within the PSF, and 
within the magnitude range set by the limit of blended light, to 
have an equal likelihood for the two scenarios, and to end up 
with half of the remaining objects as eclipsing binaries. For a 
typical magnitude (V=14.0) for the candidate star, taking into 
account the size of the C0R0T PSF, this is reached at a Amag 
= ~1.5, corresponding to k=0.8. For 8 out of the 10 remaining 
targets this level of blended light is excluded at the >3<x level. 
For a typical limit of k<0.6, the background eclipsing binary 
can at most be 0.5 magnitudes fainter than the target star, mak- 
ing this scenario a factor ~5 less likely. Do note however that 
this does not take into account physical triple systems, for which 
radial velocity follow-up is required to exclude them. Recently, 
the Kepler team have announced the discovery of ~1200 planet 
candidates (Burucki et al. 201 1). We propose to use the method 
presented here on this candidate list, to identify clear blend sys- 
tems to obtain a better estimate on the fraction of planet systems 
in this sample. 

6. Conclusions 

In this paper we have investigated to what extent informa- 
tion from lightcurves of a space-based exoplanet transit sur- 
vey can identify blended light scenarios and eliminate them as 
planet candidates, to significantly decrease the required amount 
of follow-up time. If a lightcurve has sufficiently high signal- 
to-noise, a distinction can be made between a blended eclips- 
ing binary and a transiting exoplanet. We first have simulated 
lightcurves of stellar blends and transiting planet systems to 
determine the required signal-to-noise as a function of impact 
parameter and transit depth. Our simulations show that blend 



scenarios can be distinguished from transiting systems at low 
impact parameter. At high impact parameter, blended and non- 
blended systems both produce V-shaped transits and are indis- 
tinguishable from each other. We have subsequently tested our 
method on real data from the first IRaOl field of C0R0T, con- 
centrating on the 51 candidates already identified by the C0R0T 
team (Carpano et al. 2009). We show that 70% of the planet can- 
didates in the C0R0T IRaOl field are best fit with an impact pa- 
rameter of b > 0.85, whereas ~15% are expected assuming ran- 
dom orbital orientations. By applying a cut at b < 0.85, meaning 
that ~15% of the potential planet population would be missed, 
the candidate sample decreases from 41 to 11. The lightcurves 
of 6 of those are best fit with such a low host star density that 
the planet-to-star size ratio implies an unrealistic planet radius 
of R2 > 2Rj llp . From the remaining five, two systems, C0R0T- 
lb and CoRoT-4b, have been identified by the C0R0T team as 
planets, for which the lightcurves alone rule out blended light at 
a 14%(2<x) and 31%(2cr). One other candidate is also consistent 
with a non-blended system, but is a late M-dwarf, which will 
always require radial velocity follow-up for confirmation since 
M-dwarfs can have similar radii as Jupiter mass planets. One 
other system consists of a candidate Neptune around a M-dwarf 
according to Moutou et al. (2009). We have therefore shown that 
85% of the planet candidates can be rejected for the IRaOl field 
from the lightcurves alone. We propose to use this method on 
the Kepler database to study the fraction of real planets and to 
potentially increase the efficiency of follow-up. For long period 
candidates, possible non-zero eccentricity will affect the cut in 
planet-to-star ratio versus host star density, effectively increas- 
ing the sample size. However a single high-resolution spectrum 
would be sufficient to determine the real host star density and 
estimate the size of transiting objects. 
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Appendix 

In this Appendix we discuss in detail the sample of 10 remaining 
CoRoT candidates, that were selected using the cut in impact pa- 
rameter and were presented in Section 4 and Table 1. In Figures 
9-11, we show for each candidate the blended light fraction k 
versus reduced^ 2 , and the best fitting blended light models for 
k=0.2, 0.5, 0.9, and 0.95. In Table 2, we show best-matching sys- 
tem parameters for the full CoRoT IRaOl, assuming no blended 
light. 

Comments on individual sources 

SELECTED PLANET CANDIDATES FROM THE 
LIGHTCURVES ALONE 

E2- 11 26-0 1028903 18 

We find a 2<x upper limit for blended light contribution of 
k < 0.14, therefore a blend scenario can be excluded for this 
source at high confidence using the lightcurve alone. In addition, 
by assuming that the host star is on the main sequence, its mean 
density points to a ~ 1 .5Rj up radius, well in the range of known 
hot Jupiters. Of course, this source is exoplanet CoRoT-lb 
(Barge et al. 2008). 
El -0330-0102912369 

We find a 2<x upper limit for blended light contribution of 
k < 0.31 from its lightcurve, meaning that only a small con- 
tribution of blended light is tollerated. Assuming the host star 
is on the main sequence, its mean density points to a ~1.2Rj up 
radius for the secondary. This object is identified as exoplanet 
CoRoT-4b (Aigrain et al. 2008). Eventhough the CoRoT-4b 
host star is of similar brightness as CoRoT-lb, the significantly 
longer orbital period, the residual variability (caused by a 
spotted rotating stellar photosphere) and the 1.8 times smaller 
transit depth are the causes of the lower confidence on blended 
light. 

E2-0203 -0102825481 

The 2<x upper limit for blended light is A: < 0.3 from its 
lightcurve. Radial velocity follow-up by the CoRoT team 
showed this to be an eclipsing binary of a low-mass M dwarf 
and a G-type primary (Morales et al., in prep). Assuming the 
host star is on the main sequence, its mean density points to a 
~1.7Rj up radius. Although not a planet, it is consistent with a 
non-blended system as found from our lightcurve fitting. Such 
systems always require RV follow-up since late M dwarfs and 
Jupiter-mass planets can have similar radii. 
£2-77 '12-0102826302 

We find a 2cr upper limit for blended light contribution of 
k < 0.93. We can therefore only exclude a high contribution of 
blended light for this shallow (2.4 mmag) transit. This means 
that at 2cr confidence the true eclipse depth is less than 2.4% 
in the presence of blended light. The fitted host star mean 
density points to an early type or evolved system. HARPS radial 
velocity follow-up has confirmed that the host star is an evolved 
fast rotator and Moutou et al. (2009) conclude that a triple 
system is the most probable scenario. 
El -4108-0102779966 

Because of the poor signal-to-noise of this transit and the 
relatively high impact parameter b =0.8, the 2<x upper limit for 
blended light is k < 0.95, therefore only a very high contribution 
of blended light can be excluded for this candidate. Assuming 
the host star is on the main sequence, its density is slightly 
lower compared to the solar value, indicating a stellar radius of 
R\ ~ 1.27? . However, spectroscopic follow-up with HARPS 
suggested that the host is a low mass (~ 0.8M©) star. No 



additional follow-up has thusfar been obtained by the CoRoT 
team. 

CANDIDATES REJECTED DUE TO THEIR LARGE SIZE 

£7-4677-0702753337 

The 2cr upper limit for blended light is k < 0.20, therefore 
a blend scenario can be excluded at high confidence for this 
source. Assuming the host star is on the main sequence, its very 
low density points to an early B-type primary with a K dwarf 
secondary. The planet hypothesis is rejected and no additional 
follow-up is therefore required judging from the lightcurve 
alone. Note that an orbit with an eccentricity of e=0.5, orientated 
in the right way, could increase the estimated stellar density to 
that of the Sun, and decrease R2 to 2 R Jup . This ambiguity can be 
easily removed by taking a single spectrum of the star, resolving 
its spectral type. 
E2-2430-0102815260 

We find a 2cr upper limit for blended light contribution of k < 
0.44. Again, only a small contribution of blended light is toller- 
ated. Assuming the host star is on the main sequence, its mean 
density, consistent with an A type or evolved star, points to a ra- 
dius R 2 > 2.5Rj up . Radial velocity follow-up by the CoRoT team 
showed this to be a single lined eclipsing binary of a fast rotating 
host star and an early type M dwarf (Moutou et al. 2009). 
E2-4073-0102863810 

For this source, we find a 2cr upper limit for blended light of 
k < 0.67. This object shows ~4% deep eclipses around a host 
star that is ~20% less dense than the sun. This candidate was 
introduced in the original list of Carpano et al. (2009), but is not 
mentioned in the follow-up paper of Moutou et al. (2009). With 
an anticipated secondary radius of ~2.\Rj up this object could 
still belong to the rare group of low mass stars or brown dwarfs. 
In the case of a stellar M5 secondary, the secondary eclipse 
would be detectable at ~3.5 mmag in depth. 
E2-1736-0102855534 

The 2cr upper limit for blended light is k < 0.62. The low mean 
density of the host star, consistent with a very early main se- 
quence or evolved star, points to a > 2.0Rj up radius. Analysis of 
the lightcurve reveals a secondary eclipse at the 9cr level, which 
indicates the secondary is in fact a low mass star. CoRoT radial 
velocity follow-up has confirmed that the host star is a fast ro- 
tating early type star and the system is a single lined eclipsing 
binary. 

E2-3724-01027 59638 

For this source, we find a 2cr upper limit for blended light of 
k < 0.78. Assuming the host star is on the main sequence, 
its very low density points to an A type primary, therefore 
7?2 > 2.0R Jup . This object is listed both as a planet candidate 
and a binary by Carpano et al. (2009). 
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Fig. 9: For each CoRoT IRaOl candidate: the blended light fraction k versus reduced x 2 (left panels) and the best fitting blended light models for k=0.2, 0.5, 0.9, 
and 0.95. 
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1531 0102/8062/ 


2.38 


0.09 


A A 1 /A AAA\ 

0.91(0.009) 


A AO /A AO A \ 

-0.68(0.020) 


/i An 
4.49 


OAAA niHOTOOATl 

2009 0102/880/3 


10.85 


0.25 


1 1 OVA /I 1 0\ 

1.17(0.432) 


1 /I /I / A A/1 C\ 

-1.44(0.045) 


£ OO 

6.88 


anT/i a 1 AOOAO A OA 

2//4 0102/98429 


1.61 


0.29 


1 1 A/A 1 \ 

1.19(0.133) 


1 TO /A AAT\ 

-1.32(0.003) 


o 1 1 
2.12 


TA1A A1AOOAA1A/" 

3010 0102800106 


23.21 


0.22 


1 AA/A 1 00\ 

1.00(0.127) 


A 1 O/A A A 1 \ 

-0.17(0.091) 


30.33 


/1TAA flimOAT/IQA 

4300 0102802430 


5.81 


0.12 


1 AA/A AOC\ 

1.00(0.025) 


1 AO /A AAA\ 

-1.02(0.006) 


6.27 


OA A /I A 1 AO OACOflO 

2604 0102805893 


3.82 


0.38 


1 / A ACO\ 

1.33(0.052) 


1 AA/A AAA \ 

-1.60(0.009) 


i n/i 

3.04 


O A /I O A 1 AO O 1 O O A 1 

2648 0102812861 


1 £0 

3.68 


n 1 n 
0.10 


A AO /A AO"A\ 

0.92(0.070) 


A OO/A A 1 A\ 

-0.82(0.010) 


C /II 

5.42 


OTOO AIAOOIAAOI 

2328 0102819021 


/I C 1 

4.51 


n 1 1 
0.12 


A AO/A ATO\ 

0.97(0.037) 


i £e~ic\ nno\ 

-1.66(0.008) 


i o /i 
3.24 


/1AAO AlAOOOIOOT 

4998 0102821773 


1 n no 

10.08 


n 1 /i 
0.14 


n oo/n n 1 i \ 

0.88(0.01 1) 


n i n/n t\cn\ 

-0.19(0.067) 


i -7 1 n 

17.19 


T/IOC A1AOOTC010 

3425 0102835817 


1 in 

1.19 


n ii 

0.32 


1 nc/n no /i \ 

1.25(0.024) 


i nno\ 

-1.65(0.008) 


1 i/i 
1.34 


TOC/I A 1 AO OA 1 AAA 

3854 0102841669 


ii/i 
1.14 


n nc 

0.05 


A A/1 /A ACA\ 

0.94(0.050) 


1 A A/A A AT \ 

-1.40(0.003) 


1 cn 

1.59 


1 ACO A 1 AO O A O 1 OA 

3952 0102842120 


13.48 


n no 

0.08 


A OC/A TCA 

0.85(0.356) 


1 A O / A A A O \ 

1.4/(0.068) 


74.27 


1 /I AO" A 1 AO O A O A C A 

140/ 0102842459 


5.17 


n oi 

0.27 


1 AO /A A1 T\ 

1.02(0.013) 


A A A /A A/1A\ 

0.49(0.040) 


1 O A C 

18.45 


00"0 1 A 1 AO O C A AO 1 

2/21 0102850921 


0.61 


0.29 


1 1 O /A A 1 0\ 

1.18(0.017) 


A AO /A AA -1 \ 

-0.97(0.004) 


1 /i £ 
1.4o 


AO" A A A 1 AO O C C A 0"0 

0/04 01028554/2 


i 1 
2. lo 


0.08 


A AO /A A/1T\ 

0.62(0.043) 


1 TO / A AAC\ 

-1.38(0.005) 


O /I c 

2.45 


/I AO" T A 1 AO 0£TO 1 A 

4073 0102863810 


15.00 


0.18 


A 1A/A AT£A 

0.36(0.036) 


A AO/A A/10\ 

-0.08(0.047) 


i/i /in 
24.40 


otoa nimo£mo£ 

2329 0102869286 


1.87 


0.13 


1 A /I / A /I T 0\ 

1.04(0.432) 


1 A 1 /A C O A\ 

-1.41(0.586) 


i 1 n 

2.19 


3336 0102876631 


1 in 

1.39 


n n/i 

0.04 


n o a tn i 1 n\ 

0.84(0.1 10) 


n ^n/n in\ 

-0.69(0.121) 


111 
3.12 


4Q1 1 0109881 83? 


9 17 


1(\ 




1.7/ (U.Ul J } 


1 57 

L.J / 


4339 0102903238 


1.36 


0.07 


1.00(0.126) 


-1.63(0.062) 


1.50 


4124 0102926194 


1.51 


0.41 


1.37(0.041) 


-1.74(0.005) 


1.47 


3819 0102932089 


1.57 


0.30 


1.07(0.035) 


-0.97(0.012) 


2.73 


4467 0102940315 


16.45 


0.19 


0.98(0.049) 


-0.86(0.010) 


14.25 


3856 0102954464 


16.56 


0.49 


1.31(0.047) 


0.55(0.347) 


42.02 



Table 2: The fitting parameters for our blend models when applied to the CoRoT IRaOl sample, assuming k=0. 
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